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Abstract. We consider in this paper the problem of discovering, via a tracer- 
oute algorithm, the topology of a network, whose graph is spanned by an 
infinite branching process. A subset of nodes is selected according to some 
criterion. As a measure of efficiency of the algorithm, the Steiner distance of 
the selected nodes, i.e. the size of the spanning sub-tree of these nodes, is 
investigated. For the selection of nodes, two criteria are considered: A node is 
randomly selected with a probability, which is either independent of the depth 
of the node (uniform model) or else in the depth biased model, is exponen- 
tially decaying with respect to its depth. The limiting behavior the size of the 
discovered subtree is investigated for both models. 
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1. Introduction 

In the past ten years, the Internet has known an extraordinary expansion and 
still experiences today a sustained growth. The counterpart of this success is that 
the different autonomous systems composing the global Internet have been indepen- 
dently developed by different operators. This raises some issue since the Internet is 
by construction a flat network, where the different components are interdependent 
in terms of connectivity availability, security, quality of service etc. It thus turns 
out that the knowledge of the physical layout of a network is of prime interest for 
network operators. The physical topology of a component of the Internet is in gen- 
eral very difficult to describe. To establish a representation of the whole or a part 
of the Internet, some topology exploration methods have to be devised. Various 
topology discovery experiments have been initiated by different organizations in 
order to infer the topology of the global Internet, notably the Skitter project by 



Date: June 7, 2008. 

Key words and phrases. Traceroute Algorithm. Steiner Distance. Branching Processes. Oscil- 
lating Behavior. Asymptotic Expansions. 

1 



2 



FABRICE GUILLEMIN AND PHILIPPE ROBERT 



CAIDA [3], the DIMES project [14] and many other initiatives. The method gen- 
erally proposed for analyzing the topology of a network is based on the traceroute 
facility offered by routers. Roughly speaking, a traceroute procedure consists of 
sending traceroute messages between hosts as follows: 



Traceroute Algorithm 

If H and G are hosts participating in the topology discovery experiment, H sends 
to G a traceroute message so that all the hosts/routers on the path (H,G) are 
identified. 



The purpose of this paper is to investigate the efficiency of the traceroute algo- 
rithm. While a large number of experimental papers are available in the technical 
literature on the analysis of the topology of the Internet, a very few studies pro- 
vide analytical insight into the efficiency of these topology discovery methods; see 
Vespignani et al. [5] for a discussion and Azzana et al. [2] for an analysis in the case 
of specific deterministic trees. 

In this paper, a more realistic model is proposed to include some randomness 
in the degree of the nodes of the graph representing the topology of a network. 
One specifically considers a network with a random tree architecture spanned by 
a Galton- Watson branching process. We shall restrict the analysis to the case 
of offspring distributions, which have a finite second momenmt. This notably pre- 
cludes the case of power law distributions with infinite second moments, typically 
distributions G such that P(G > n) ~ CrT a with a £ (0, 2). 

The Internet graph is definitely not a tree, since many studies (see the Skitter 
project) show that there is a core of highly connected routers. Nevertheless, some 
components of the Internet have a topology close to a tree structure. This is notably 
the case of access or collect networks, which play the role of capillarity networks 
in charge of collecting and distributing traffic between customers and the core of 
the Internet. This latter component is not critical for the problem we study in 
this paper since core routers are easy to discover by traceroute procedures. This is 
why we focus on collect networks, which can be represented by a tree architecture, 
spanned by a branching process. In addition, to get more insight into the topology 
discovery process in the case of a large network, it is assumed that the underlying 
branching process does not terminate with probability 1; in particular the depth of 
the tree is infinite. 

The discovery process is as follows: a random number of nodes are selected 
among the nodes of the tree. After the selected nodes have performed the traceroute 
algorithm, the set of the nodes discovered is the spanning tree of the selected nodes. 
The performance criterion used in this paper is simply the size of this sub-tree. In 
graph theory it is known as the Steiner distance of the selected nodes (with the 
slight difference that the selected nodes are not counted). It has been the subject of 
a recent interest by Mahmoud and Neininger [9] and Christophi and Mahmoud [4] 
which considered the asymptotic behavior of the distance between two random 
nodes of the tree. Panholzer [T2], Panholzer and Prodinger [T3] proved central 
limit theorems when multiple points are considered. The asymptotics investigated 
in these papers concern the size of the random tree. In our paper, we will study 
two situations: when the size of the tree and the number of selected nodes go to 
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infinity and also when the infinite tree is fixed and the number of selected nodes 
grows. See Panholzer [12j for a thorough discussion of the literature in this domain. 

Two stochastic models for selecting the nodes in the network are considered. 
In the first model, the uniform model, we adopt the point of view of an external 
observer to the network; a set of nodes is chosen at random and a traceroute 
algorithm is performed. In the second model, the depth biased model, the observer 
is located at one node (the root node) and it chooses more likely nodes not too far 
away. As it will be seen, in the uniform model , the selected nodes are basically in 
the "bottom" of the tree where most of the nodes are, while in the second model 
they are more concentrated at the "top" of the tree. 

In the first model, referred to as the uniform model, nodes whose depth is less 
than N > are randomly chosen with probability 1 — exp(— A) for some A > 
independently of the position of the node in the tree. The quantity analyzed here 
is the ratio pat (A) of the mean size E(i?/v) of the sub-tree discovered and the mean 
number E(T/v) of nodes of the tree whose depth is less than N. The quantity /9/v(A) 
denotes the fraction of the tree discovered. The asymptotic results of this paper 
first determine the limit p(A) of pat(A) as iV tends to infinity. In a second step, 
the asymptotic behavior of p(A) for A — > is investigated. This last point gives an 
indication of the efficiency of the algorithm when only a few nodes are selected in 
the topology discovery experiment. 

For the uniform model, it is shown in Theorem[T]that, for small A, the exploration 
rate p(A)/A is equivalent to log m A where m is the mean value of the offspring 
distribution of a node, so that at the first order the algorithm is very efficient. A 
second order analysis, Proposition [2 reveals that the standard deviation of the 
size of the discovered tree scales with the mean size of the tree, except when the 
offspring distribution is deterministic. This latter case is degenerate in the sense 
that the standard deviation is negligible when compared to the mean value. 

In the second model, referred to as the depth biased model, the probability of 
selecting a node depends on its depth in the tree so that the mean number of 
selected nodes at depth n is a n for some a > 0. It is shown in Theorem [3] that 
the ratio of the average of the size R(a) of the sub-tree discovered and the average 
number of selected nodes is equivalent to 1/(1 — a). 

The paper is organized as follows: In Section [2j the models for the selection 
of the nodes of the tree are introduced. The uniform model is investigated in 
Section 2] and the depth biased model in Section \5\ The main ingredients for the 
analysis of these models are Kesten-Stigum Theorem and some results on the rates 
of convergence for Galton- Watson branching processes and a general limit theorem 
proved in Section [3] 

Acknowledgments. The authors wish to thank two anonymous referees for their 
work, their detailed comments have helped us a lot to improve and correct mistakes 
in the first version of the paper. 

2. Problem Formulation 

Throughout this paper, we consider a Galton- Watson branching process, whose 
graph is a tree denoted by T. Each element of the nth generation (or nth level) gives 
birth to G nodes at the (n + l)th generation independently of the other elements 
of the nth level, where the offspring G is some integrable random variable. (See 
Athreya and Ney PQ and Lyons and Peres [7] for an introduction to random trees.) 
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It is assumed that P(G=0)=0 and P(G > 2) > 0, in particular the tree is 
supercritical, i.e. to = E(G) > 1. For n > 0, the variable Z n denotes the number 
of nodes at level n, in particular Zq = 1. For 1 < £ < Z n , a node of the tree 
can be represented as a pair (n,£), where n is its generation and £ its rank within 
the generation. (For notational conventions, see Neveu [11] for example.) Let 
denote the sub-tree of T with depth less than or equal to k and with root at node 
(n,£). The size of TP"' is denoted by TJ?' . When (n,£) is the root node, i.e. 
(n, £) — (0, 1), the upper index (0, 1) is omitted. With the above notation, one gets 
easily that for all N > 1 and n = 1 , . . . , N 

n-i z„ 

(1) T N = Y. Z ^H T N-n- 

Let us consider a counting measure M on the tree representing the distribution 
of the points selected in the tree: For a subset A of the nodes of the tree, N(A) 
denotes the total number of points in A. By selecting nodes, a sub-tree from T is 
obtained through the traceroute algorithm; this sub-tree is referred to as sampled 
tree. See Figure [1] 




Figure 1 . Traceroute Algorithm. 

To complete the description of the problem, it remains to specify how the nodes 
of the original tree are selected. In the following, we shall consider two selection 
criteria: 

Uniform model: Nodes are chosen at random on all the nodes of the tree 
whose depth is less than or equal to N, N being a fixed integer. A node 
is selected with probability 1 — exp(— A) independently of his depth in the 
tree. The mean number of nodes involved in the discovery experiment is 
then (1 — exp(— \))(m N+1 — l)/(m — 1). (Recall that the mean size of the 
nth generation is to™, n > 0, where m = E(G), the mean of the offspring 
variable G.) 

To investigate the topology discovery process, we shall consider for a 
fixed N > the N first levels of the original tree T and count the number 
of nodes which are discovered, given by 

N Ziv-„ 

( 2 ) fi JV = X/ IJ 1 {A^(T„ JV — ^)#0}- 

n=0 1=1 
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In the following, we shall be particularly interested in the quantity 

E(R N ) 



(3) P n(X) 



i.e., the ratio of the mean number of discovered nodes to the mean number 
of nodes in the tree, when the analysis is restricted to the N first levels 
of the tree. Then the behavior of this ratio when the number N of levels 
tends to infinity is investigated. 
Depth biased model: Nodes at given level n are selected with probability 
1 — exp[— (a/m) n \ for some a G (0, 1). The mean number of nodes selected 
at level n is m"(l — exp[— (a/m)™]) ~ a n and therefore is exponentially 
decreasing with respect to the depth. The rational behind that is the fact 
that, for this model, the traceroute procedure will rarely select nodes "far 
away" from the root node, in contrary to the uniform case where geometric 
aspects are completely ignored for the selections of the hosts. 

By denoting by R(a) the total number of nodes discovered, the efficiency 
of the traceroute algorithm is measured in this case through the ratio of 
the mean K(R(a)) to the average number of selected nodes. The limiting 
behavior when the average number of selected nodes becomes large, i.e. 
when a /* 1, is investigated. 

Additionally it is assumed that the root node of the tree is always selected; it is not 
difficult to show that for both models described above, the root node belongs to 
the sample tree with a very high probability and then the above assumption is not 
really restrictive. This implies that a node (n, £) of the tree T at level n belongs 
to the sampled tree whenever Af(T^'_ n ) is not 0. In other words, a node of the 
original tree belongs to the discovered tree if at least one of his descendants has 
been selected. In the following, we shall use the following notation: for a subtree 
q-N-n,£ r00 t ec j a t a vertex (N — n, £) of the (N — n)th generation of the tree T and 
with depth n, the quantity P(A/'(7^ V ~™' £ ) 7^ 0) is the probability that a least one 
vertex of the subtree T^~ n ' is marked and (N — n, £) G T. 

Before proceeding to the analysis of the topology discovery process, we prove in 
the next section a technical result, which is important in the analysis of the speed 
of the exploration process. 



3. A Convergence Result 

To prove asymptotic expansions in the following sections, the following propo- 
sition will repeatedly be used. Its proof is based on integral representations and 
Fubini's Theorem instead of complex analysis techniques as it is usually the case in 
the context of harmonic series. See Robert [TH] for a presentation of these methods. 

Proposition 1. Let V be a positive random variable with E(V^ 2 ) < +00 and h be 
a non-negative twice differentiate function on R+ such that h(0) = 0. In addition, 
it is assumed that the function h' is integrable with h'(0) ^ and that there exists 
some constant K > such that \h r/ (x)\ < K for all x £ [0, 00). 
The function ^(h)(x) defined by 

+00 . 

(4) *(ft)W = E- E (M^m fl )). *>0, 

£ — ' m 

n=0 
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is such that 



«-oa?log m (l/a!) 



Proof. Since h is non-negative and \h'\ integrable with respect to Lebesgue measure 
on M_|_, Fubini's Theorem applied twice shows that 'J (ft) can be expressed as 
+00 / +00 \ 

*(h)(x) = ]T — E (ft (xVm n )) = E £ — h (xVm n ) 

n=0 \n=0 / 

/+00 „ +co \ 

= E \Y1 / h '{ u )^-{u<xVm^} du 

( + oo +00 J \ 
^ h'(u) Y^—l { u< x v m ^} duj . 

The function \P(ft) is thus well defined. 

Since h'(Q) > 0, Fatou's Lemma applied successively gives the relation 

liminf ^ A ' > > liminf E — - - 

x—>o x — ' x^o m n \ x I 

n=0 v ' 

> E^ E fHminf = g (m _ i)E(V)fc'(0) = +00, 



therefore the ratio ^f(h)(x)/x diverges as x — ► 0. 
By using representation ([5]) of ^(ft), we have 

(m - l)*(ft)(a;) =mE[ [ h'(u) du] 



+ E| / — n ? /^m fo'(") cfat I + E f / — n r - r -rp-h' (u) du 

where |_2/J is the integer part of y 6 R. One first shows that only the central term 
of the right hand side plays a role in the asymptotic behavior of 'I' (ft) at the first 
order. 

For the first term, note that, if ||ft"||oo is the norm of ft", 



-j-EM h'{u)du 



< ^E t f (ft'(0) + ttll^'Hoo) d«) < h'(0)E(V) + |E(V 2 )||ft"|| c 



For u > V, one has 
and hence, 



-E / — n , , . . |ft (u)\ du <m \h(u)\du. 
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By gathering these estimations, it follows that the following equivalence 
/ f V 1 




j; m Nm(''/ iy )J 



h'(u) du 



v / y m o° gm (^v) } ^H du 



xV 



u 



holds as x — » 0, with {y} = j/ — [y\ , the fractional value of y e M. The above 
equivalence can be rewritten as 



*W(*) F /V /• V „no g „fwxv)> ^(")-^(0) 



rV 

x y u y 

/ /-V m {logJ«/^)} \ 

+ ft'(0)E [ V / du 

Due to the boundedness of h" and the integrability of V 2 , the first term in the right 
hand side of the above equation is bounded as x goes to 0. Hence, only the second 
term has to be considered. For x < 1, we have 



I. 



^ TO {l°g m («/zV)} rl/x m {log m («)} 

du = du 



xV « 

fc+i 

E 

fc>0:m fc <l/2; 



,{log m («)} 



■dtt + 0(l) = (m-l)L-log Bl (a ! )J+0(l) 



u 

and the result follows. □ 



Asymptotic behavior of algorithms with an underlying tree structure has been 
extensively investigated, see Flajolet et al. [6], Mohamed and Robert [10] and Mah- 
moud [S] for a general presentation. By using the terminology of Flajolet et al. [5], 
for non-negative sequences (A n ) and (fi n ), a series like 

(6) G(x) = ^2 Kg(Hnx), 

n>0 

for some function g is defined as an harmonic sum. Because of the integration of 
the random variable V and given that one wants the weakest assumptions on this 
random variable, series (JH) could be seen as a special case of harmonic sums. The 
fact that the sequences (A„) and (/i„) are specific in Expression Q is not a real 
restriction, see Robert [T5] , 

Flajolet et al. [B] derives the asymptotic expansion of G{x) when x goes to 
or +oo by using Mellin transform techniques. For s € C, if h*(s) is the Mcllin 
transform of h, i.e. for s in some vertical strip of C, 

r+oo 

h*(s) = / h(x) x s - x dx, 



it is easy to check that the Mellin transform of ^(h) is given by 

Following the methods of Flajolet et al. [6], to derive the asymptotic behavior of 
^(h)(x) as x goes to infinity, one has to identify the first singularity of &(h)* on 
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the right of the maximal vertical strip where it is defined. In particular, some 
conditions on the finiteness of some fractional moments of the random variable V 
have to be assumed (as well as growth conditions on h*). From this point of view, 
our approach is minimal since only the finiteness of K(V 2 ) and differentiability 
conditions on h are assumed. It turns out that it is important as it will be seen in 
the following sections, since in practice little is known on the fractional moments 
of the corresponding variable V. 

4. The Exploration Rate in the Uniform Model 

In this section, nodes are selected at random with uniform probability in the 
tree with depth less than TV. The variable Rn is the size of the underlying sub- 
tree (or sampled tree) containing the selected nodes. The asymptotic behavior of 
Pn(^) — E(i?jv)/E(T/v), the fraction of discovered nodes, when N tends to infinity 
is investigated. In the second part of this section, the ratio var (Rn) /E(Tjv) is 
analyzed. 

4.1. First Order Asymptotics. In the uniform case, the limiting behavior of the 
ratio /o/v(A) when N tends to infinity is given by the following result. 

Theorem 1. The ratio of the average size Rn of the sampled tree to the total 
average size of the tree E(T/v) satisfies the relation 

(7) p(A) d M- to MA) =£ ^ fl-E (exp f-A][> 

n=0 \ V \ i=0 

// additionally the condition E (G 2 ) < +oo holds then 

(8) Urn - PW j- = 1. 
yJ A-oAlog m (l/A) 

Relation © shows that the rate of increase of the discovery process is infinite 
near the origin. This implies that with only a few selected nodes one has the 
impression of rapidly discovering the whole network. 

Proof. By conditioning on the tree, the conditional probability that node (N — n,£) 
does not belong to the sampled tree is 



P (AT (T n N - n ' £ ) # 0| r) = 1 - exp (—AT, 



N—n,. 



By summing-up these relations, one obtains that the expected value of Rn, i.e., 
the average number of nodes in the sampled tree, is given by 

N 

E{R N ) = J2 E(^Ar-n) (1 — E (exp (— AT n ))) 

n=0 

= E mN ~ n [ 1 ~ E ( cxp ( _A E Zi 

n=0 V V V i=0 

The limit when N — ► oo of the ratio pat (A) is then given by 



P(A) d = lira p N {\) = 

N— »+oo 
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since E(Tat) ~ m N+1 /(m — 1) for large N. This proves the first equality stated in 
Theorem [TJ 

We now study the behavior of p(A) when A goes to 0. Since E(G 2 ) < +oo, 
Kesten-Stigum's Theorem ensures the existence of a random variable W such that 
W(W > 0) = 1 (because of the assumption on the distribution of G) and E(W / ) = 1 
(See Lyons and Peres [7]) and that, almost surely, 



(9) 

Let us define 



, . Z n 
lim — 



W. 



+oo 1 

dcf. m — l 



/(A) d = f - E 



,n+l 



l — E exp —XW 



m — 1 



Then, 



(10) 



|p(A)-/(A)| 



< 



A 

+00 

E 

n=0 



m — 1 
Am" +1 



E I exp I -X^TZi 



exp -AW- 



m — 1 



Since is integrable, Lebesgue's dominated convergence Theorem gives that 



(ff) HmiE^expf-A^^ 



exp -AW- 



m — 1 



• E z < ir- 



1 



si=0 



to — 1 



We have 
1 

m" +1 A 



exp 



exp —XW- 



i n+1 - 1 
m — 1 



n 



i=0 



From Athreya and Ney [TJ Theorem f , page 54], for n > 1, there exists a sequence 
(W*) of i.i.d. random variables with the same distribution as W such that 



(12) 



Z n -m n W = ^2(1-W i ). 



By using Cauchy-Shwartz's Inequality, we obtain 



E (\Z n - m n W\) | < ^/E((Z n ~m"Wy) 



(13) 



Vai(l-W)y/E(Z n ) 
Var(l - W)m n/2 . 
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From the above inequality, we deduce that 
1 



m n + l \ 



E (^exp (~\it Z *)) - E ( CX P (-^ "C-l 1 

Var(l - W) 1 



< 



1 m (n+l)/2 ■ 

Relation (fTT|) and Lebesgue's Theorem then imply that 

ii m ^-^=o. 

A^O A 

Hence, up to an expression which is of the order of o(A), the behavior at of p(X) 
is equivalent to the behavior of /(A) as A becomes small. 

By using Proposition[I] we have by taking h(u) = l — e~ u and V = Win/ (m— 1), 

+oo 

E ^TT (1 - E ( ex P (-*Vm n ))) = ~ x\og m {l/x) 

71=0 

as x — * 0. To conclude the proof, we note that 

to *(*)(*) -/(«) =0 

z^o a;log m x 

and the result follows. □ 

4.2. Second Order Properties. The results obtained in the previous section 
show that the size of the sampled tree is of the same order of magnitude as the 
original tree when the probability of selecting a node is fixed. When this probability 
is very small (i.e., for small A), the speed of the discovery process is even very fast. 
In this section, we evaluate the second moment of the random variable Rjq in order 
to estimate the dispersion of the size of the sampled tree around the mean value. 

In the rest of this section, we use the following notation: If (n,£) and (n',£') 
are two nodes of the tree, the relation (n',1') < (n,£) indicates that the nodes are 
distinct and that {n',£') is a node of the sub-tree whose root is (n,£). 

Proposition 2 (Asymptotic behavior of the variance). When the size N of the 
original tree goes to infinity, the variance of the size of the sampled tree is such that 

(1) If the random variable G is not deterministic andM(G 2 ) < +oo, then 

Var(Ejv) _ Var(G) ^ 2 
n'~oo E(T N y 

where p(A) is defined by Equation jJJ . 

(2) If G = m almost surely, then 

(15) $\\) d £ lim = l!llyI[ E ( e ^)(l-Ert 
v ; H2 y ' w^oo E(Tjv) m ^ to" L v M v " 

n-1 

+2 ^ E (e-^*-*- 1 Z n _ k E (e-^)^-"" 1 E (e" AT * (l - e~ AT *))) 

k=0 

where T n = (m n+1 — l)/(m — 1). 



( 14 ) P2 ( A ) = J lm „ Twtrr_ A2 = — —Py X > 
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It is worth noting that the case of a deterministic offspring distribution is de- 
generate in the sense that the standard deviation of the size of the tree discovered 
by means of the traceroute algorithm does not scale with the size of the tree (and 
the discovered tree). The coefficient of variation of the random variables Rn tends 
to when TV goes to infinity. 

Proof. Using Representation ([2|) for the size of the sampled tree, one obtains by 
conditioning on the tree the relation 



Rn — E(-Rjv) — An.j + An,2 + An,3, 



where 



JV Zjv-„ 

n=0 t=\ 
N 

A N , 2 = £(£ N-n — E(ZN-n))W (Af(T n ) ^ 0) , 
n=0 

N Z N _„ 

A ^ = ££(!" exp(-AT r f -"-0 - P(jV(T„) f 0)) 

with = l{ A T(r JV_ ™'*)^o} — 0- ~ ex P(~AT,f _n '^)). Note that if distinct nodes 
(n, I) and (n',l') cannot be compared with the relation "<" then, conditionally 
on the tree T, the corresponding random variables and A^, are centered and 
independent. In addition, note that Ana = Rn — E(Rn \ T). 

To study the variance of the random variable Rn, we separately consider the 
second moments of the terms Ajv,i, A/v,2 and An,3- Of course, the terms An,2 and 
An, 3 are non null if and only if the variable G is not deterministic. 

The second moment of An.i- It is shown that the second moment of An,i is 
of the order of m N . By using the independence in the selection of nodes in the 
tree and the fact that the random variables A^ are centered conditionally on T, we 
have the identity 

e(a 2 na i t) = Yl Var ( A « I T ) + 2 £ E ( A 'X' I T 

(n,i)eT N (n,e),(n',e')£T N 

(«' ,£')<(n,i) 

Conditioning on the state of the tree, when (n',£') < (n,£), one has the identity 

E (AfX' I r) = exp (-AT»-^) (l - exp (-AT^')) . 

By symmetry, the above computations yield the following relation for the second 
moment E(^4^ x ) 

JV 

U N d = E(A% A ) - Yl E(^-„)Var(A I \) 

n=0 

/ 



2E 



£ E{Z N . n ) exp (-XT?-** 1 ) i 1 - ex P (" 

n=0 (N-n ,l')eT N 

\ (N-n' ,l')<(N-n,l) 



yrpN-n',1' 
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where Var(A n ) is the variance of the random variable 1{jV(t„)^o} — ^ > (^(%i) 7^ 0). 

For two nodes of the tree such that (N—n',1 1 ) < (N—n, 1), Equation fl} gives 
the relation 

n— n f — 1 Z n _ n i 
dist. \ -> ~ . \ -> r^N-n'J' 



E E T n 



k=0 



where (Zk, k > 0) denotes another independent Galton- Watson process independent 
of (Z n , n > 0) with the same offspring distribution. By using this relation, we have 



E 



exp (— AT„ 



N-n,l\ 



E 



-1 / \rr>N— n',£' 

1 - exp -AT ' 



V 



{N-n',l')eT N 
(N-n' ,l')<(N-n,l) 



E EE e X p(-AT^)(l-exp(-ATj-»'^ 



n—n' — l 



^exp -A Z A E ( V n-n>) , 



i n'=0 



where 



£ exp -A E ^- nV " (I" -P -AT* 



-,N-n'J' 



e"=i 



By using the independence of the different trees T n , n ' e for £ = 1, . . . , Z n - n >, we 
have 

E(V^_ n ' I Zo, . . . , Z n - n '-i) 
= E (Z n . n , (E (e- AT "')) Z "-"' _1 I ^0, ... , ^-n'-i) IE (e- AT "' (l - e^-')) . 
It follows that by using the above expression for Un, one obtains 



N 



n-1 



n—n—1 

U N = 2J2 E(Z W _„) E E ( CX P ( - A E Zfe 

fc=0 



71=0 



Z„_„/E ( exp (—AT 1 ,,/ 



E exp (-AT„0 (1 - exp (— AT n /)) 



Dividing by E(Tjv), we have 
[/at _ 2(m-l) 



E(Tjv) m 



^ N / / n— n'— 1 \ 

1/m" ^ m™ ^ E I 6XP I ~ A ^ Zfc j 

' n=0 n'=0 V V k=0 J 



Z n - n ,E ( exp (-AT 1 ,,/) 



Z„ „/-l 



E exp (-AT„0 (1 - exp (— AT n /)) ; 
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By letting N go to infinity, we finally obtain the relation for the second moment of 
the random variable An i 



(16) lim 



m-1 



n^oo E(Tjv) m 



Ei[E(^")(l^(e-")) 



k=0 

The second moment of Am, 2- We have 

N ,„ ^ lrT ,w, m / _,\7, 



A 



N,2 



111 



N 



E 

n=0 



(Z N _ n -E(Z N _ n )) (l-E(e- AJ -)) 



m 



N-n 



m 1. 



N 

E 



If ||-ff||2 = -\/E(iJ 2 ) for some random variable H, then we have 



.4 



N 



N,2 



(1 -E(e~ Ai ")) 



n=0 



N 

^E 

2 n=0 



w 



z 



N-n 



,N-n 



(l-E(e~ AT ")) 



(1 -E(e- AT ")) 



where W is defined by Equation |9]). Athreya and Ney [U Theorem 2, page 9] 
gives that the sequence (||W — Z n /m n \\ 2 ) converges to 0. This implies 

2 



(17) 



lim 



E(^, 2 ) (m - l)Var(G) ^ (1 - E(e~ AT ™ )) 



7V^+oo E(Tj\r) 2 TO 3 

since E ((1 — W) 2 ) = Var(G)/m(m - 1). 
4.3. Second moment of An 3. Clearly 



E 



m" 



1-4 



N,3 2 



< 



E 

n=0 



Zn-ti 



£ exp(-AT r f - E(cxp(-AT n )) 



f=i 



and since conditionally on Zn-hi the random variables exp(— XT 7 ^~ n,e ) for £ 

1 Zn-u are independent and identically distributed with mean E(exp(— AT„)), 

we then have 

2 



£ expt-AT^-^) - E(exp(-AT„)) 



e=i 

It follows that 

and then 
(18) 



E(Z Ar _„)Var(exp(-AT„)) 



Italia < 



<mP- n l 

m (N+l)/2 



m — 1 



lim sup ■ 



1(^,3) 



N- 



-,N 



< 



(yfiH- l) 2 ' 
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Since R N - E(R N ) = A Nt i + A N , 2 + An,3, Relations (US]), dTTJ) and QH) then 
imply that 

(1) When G is non-deterministic, the expression A^ y 2 dominates in Rn—~E(Rn) 
so that Var(i?jv)/E(7Ar) 2 is converging to the right hand side of Equa- 
tion (fT7|). 

(2) If G = m, the term A^^ vanishes so that Var(i?jv)/E(T/v) is converging to 
the right hand side of Equation (|16p . 

Equations (fH| and ([15]) are established. □ 

As for the first moment of Rn-, we turn now to the analysis of the behavior of 
of the second order characteristics defined by Equations (fTi)) and (| 1 5[) when A is in 
the neighborhood of 0. For the non deterministic case, we have from Proposition [T] 

p«(A) _ Var(G) 



a^o (Alog m (l/A)) 2 (m 2 -m)' 

In Proposition^ the expression of p^p (A) is defined a priori only for a deterministic 
offspring distribution, but can be extended to any offspring distribution by using 
the right hand side of Equation (|15p. In the following, we study the behavior of 
p 2 (A) for an arbitrary offspring distribution. 

(2) 

Lemma 1 (Asymptotic Behavior of X^p 2 (A) at 0). Provided that the random 
variable G has a finite second moment, the function p 2 2 ^ (A) defined by Equation (|15|) 
is such that 

(19) lim u f (X l 2 = 1. 

a\o A(log m A) 2 

Proof. Define 

/a (A) d = ^2 ( E (e~ AT ") (l - E (e~ AT ™))) 



m 

n=l 



and 

dcf. 



E ^ E E (e" AT — ^-*E (e-^f— 1 ) E (e"^ (l - 



m 

n=l k=0 



Equation (J^SJ) gives that mp { 2 ] {X) = 2/ b (A) + f a (A) . 

Asymptotic behavior of f a . With similar arguments as in the proof of Theorem [1] 
the asymptotic behavior of / a (A) when A goes to is equivalent to the asymptotic 
behavior of 

/ ( Wm n+1 \\ ( ( ( Wm n+1 
E — (-p (-A— p j J (l - E (cxp (-A-— r 
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If W\ and W2 are two independent random variables with the same distribution as 
W, the above series can be rewritten as 



+ OO 



E exp -A — — 1 - E exp -A 

m \ \ m — 1 J \ \ \ m — 1 

= (m - 1) E ( — E(ft(A(Wi + W 2 )m"/(m - 1))) - E(/i(AWim"/(m - 1))) J , 

n=l ^ m ' 

with = 1 — e~ u . Consequently, 

(20) lim { a } X) - = m 

a^o -Alog m A 

by Proposition [1] 

Asymptotic behavior of /{,. Let us fix some e > and assume that A < e. The 
function /&(A) can be rewritten as 

Llog ro (e/A)J _ 00 

(2D AW= E ^^ A ) + E ^^ A ) 

n=l n=Llog m (e/A)J+l 



where 

n-l 



S(n; A) = ^ E ( e - AT — Z„_ fe E (e-^)*-* -1 ) E (e- ATfc (l - e - ATfc )) 

fc=0 

Since for x > 0, e _:E (l — e _2: ) < a;, we easily deduce that for all n > 1 

n-l 

S(n;A) < ^E(Z„_ fc )E(AT fe ) < 

and then 



n\m n+1 
m — 1 

fe=0 



Llog m (£/A)J _ 

E ^5(n;A) < |Alog m ( £ /A)(log m ( £ /A) + l). 

n=l 

The second term in the right hand side of Equation (f2"Tj) can be written as 
(22) E ^5(Llog m ( e /A)j+l;A) 

OO 1 

+ E ^(S(n;A)-S(Llog m ( £ /A)j +1;A)). 

«=Uog m (e/A)J + l m 

By using the fact that for x > and a > 0, xe~ ax < 1/a, we get that 

1 Llog m (e/A)j 

5(Llog m ( £ /A)j + l;A)<^-^ g 

The relation E (exp (-AT fc )) > exp (— AE(Tfc)) > exp(-me/(m - 1)) holds by 
Jensen's Inequality under the condition that k < |_log m (e / A) J . In addition, 

/ v 2 
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where a 2 is the variance of the random variable G, so that for k < |_log m (e/A)J 
.„„, AE(T, 2 ) \m k m k fa 2 \ e fa 2 

(23) — y+ < - — — — j-— — - + 1 < - — — — - + 1 



(Tfc) (m — l)(mm k — 1) \m — 1 J (m — l) 2 \m — 1 
Since for x > 0, e~ x < 1 — x + x 2 /2, we have 

E ( (XT ) 2 } { 

E(e- XTk ) < 1 - AE(T fe ) + U fc; ; < 1 - AE(T fe ) ( 1 

and then, for k < |log m (e/A)J, 

AE(T fc ) ^ AE(T fe ) 



(m — l) 2 \m — 1 



(24) ,„„ p " , < 



•logE(e- AT *) " 1-E(e- AT '=) 

*( 1 -^(^T + 1 )r*"<" 



as long as 



<(m-l)V(^n + l]^ ei . 



It follows that for e < ex, 

S(Llog m ( £ /A)j + 1; A) < (1 + log m (e/A)) e "-/(" l - 1 )«(e). 
and therefore, 

£ Hi_S(Llog m ( e /A)j + 1; A) < V^A)^"- 11 ^ 
* — ' m e 

»=Ll°g m (e/A)J + l 

which is o(A(logA) 2 ) when A — > 0. In addition, the second term in the right hand 
side of Equation (|22[) can be rewritten as 



OO OO 

E ^ E ^ E (e-^-Z^e-^)^) E (e-^ (l - e"^)) . 
fc =Ll°g m (e/A)J + i n=l 

We first note that 

E^K(e- AT - 1 W^)^- l ) = 

Uog m (e/A)J 

£ -^E(e---Z„E (e -^)^-) 



m" 

n=l 



n=l 



oo 

V -!-E(e- AT »- 1 Z n E( e -^)^- 1 ) 



m 

n=Llog m (e/A)J+l 



The first term in the right hand side of the above equation is less than or equal to 
the quantity log m (e/A) since E(Z„) = m". The second term can be upper bounded 
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as 



m 

n=Llog m (e/A)J+l 

oo 

' m n (m — lie 

«=Llog m (e/A)j + l V ; 

where we have used the fact that Tk > 1 for all k > and ie" Al < l/(eA) for all 
a; > 0. It follows that the second term in the right hand side of Equation (|22[) is 
upper bounded by the quantity 

7 ( l0Sm(£/A) + 

which is o(A(log m A) 2 ) when A — > 0. 

By using the above inequalities, we come up with the conclusion that for every 
£ > 0, 

25 hm sup — — < — . 

A ^o A(log m X) 1 2 

For establishing a lower bound for /&(A), we introduce the size-biased Galton- 
Watson branching process. The sequence of random variables (Z n /m n ) being a 
positive martingale, it induces a probability distribution P such that, for any n > 
1 and any random variable Y measurable with respect to the random variables 
Z\, ■ ■ ■ , Z n , 

Z n 



s 



YdP = E Y 



It is known, see Lyons and Peres j7], that under the probability P, the sequence 
(Z n ) as the same distribution as a branching process with immigration (Z n ) where 
the number of children has the same distribution as G and the number of new 
immigrants is distributed as G such that P(G = n) = n¥(G = n)/m. If Zq = 1, it 
is easy to check that 

m(z n ) =m"+ ^—L e(G 2 ). 

V / mym — \) 

If T n = Zq + Z\ + • • • + Z n , we have by Jensen inequality 

... J. j e -AT„_ fc]E C e -AT fc 



> exp (-A(l + E(T fe ))E(f„_ fc )) 

/ Am / „ 7, m" +1 \ / o 2 

> exp m""* + 1 + - 

\ m — 1 \ (to — 1) J \ to 



since 



£(f w ) < ^LL (i + * 

m — 1 V to 
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where 52 = E(G 2 ). In addition, by using the fact that e x (l — e x ) > x — 2x 2 holds 
for x > 0, we have 

Llog m (e/A)j Ll°g m (e/A)J 

m — 1 



J2 ^S(n;X)> J2 ( m - X ) 
^— ' m n z — ' 

i—l n— 1 

g ^ -p {~i K' + (> + S) ) E («i - 2 <-*> 2 > 



n-l 

X 

fc=0 



, em / m \ / o? 
> exp 1 + 1 + - 

[m — 1) \ (m — 1) J \ m 

l>g m (e/A)J n-l 



n=l fc=o 
By using Inequality (f2U)) and Definition (|24p . we have, for e < e 1; 

Liog^e/A)j <^ XE(T k ) ( AE(T 2 ) \ A L '° g ^ /A)J m *+i - l 

Z-j ( m X ) 2^ m fc I 1 E(Tfe) J ~ K (e) ^ ^ m k 

n=l k=0 v w n=l fc=0 

Since 

Uo g gA)J g = m + + ^Llog m (s/A)j 

t-^ mr 2 m — 1 

n=l fc=o 

1 

- 1 



(m- l) 2 VmL lo g m (^A)J 

and since we already know that the second term in the right hand side of Equa- 
tion (f2"Tj) is o(A(log m (A)) 2 when A — + 0, we then deduce that for all e G (0, £1) 

liminf , ,/ b ^ - xo > — — exp ( —-, ( 1 + r | (l + — 

a^o A(log m A) 2 - 2k(e) p V (m - 1) V (m - 1)/ V m 

and hence, 

(26) hmmf — -— r > — . 

Alog m A 2 2 



Combining Equations (|20|). (|25[) and (|26[). Equation (19]) follows. □ 
Proposition 3. The functions p^\X) and p^\X) are such that 



(28) limT^W = 1 



^ 2) (A) 
a-»o A(log m A) 2 

where G is the random variable describing the offspring of a node. 

From Theorem [TJ we observe that the size of the sampled tree scales with the 
size of the original tree. The same phenomenon is true for the squared coefficient of 
variation of the size of the sampled tree if an only if the offspring distribution is not 
deterministic as shown by Proposition O In the case of a deterministic offspring 
distribution, when A — ► 0, the squared coefficient of variation is approximately equal 
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to 1/(AE(Tjv)) for large N. The quantity XE(Tn) is precisely the mean number 
of selected points. This indicates that the distribution of the random variable Rn 
is concentrated around its mean value. There is almost no randomness in the 
discovered tree. 



5. The Depth Biased Model 

In this section, it is assumed that conditionally on the tree, for n > 0, a node 
at depth n is chosen with probability (1 — exp(— (a/m) n )) for some a <G [0,1). 
The mean number of selected nodes at depth n in the tree is equal to to"(1 — 
exp(— (a/m)")) ~ a n and the total number of selected nodes in the whole tree 
A/"(T) is such that 

11 00 1 

< E(Af(T)) = V m"(l - e -(«/'™)") < 




I- a 2(1 - a 2 /to) - v v " ^ y '-\-a 

K 1 ' n=0 

in particular the mean number of selected nodes A/"(T) ~ 1/(1 — a) when a —> 1. 
The behavior of the size R(a) of the sampled tree is used to estimate the speed 
of the exploration process, when the number of selected nodes becomes large. We 
first give the expression of the mean value E(R(a)) of the size of the sampled tree. 

Lemma 2. The mean value of the size of the sampled tree in the depth biased model 
is given by 

(29) E(R(a)) = J^to™ ^1 - E ^cxp [ - 

Proof. As in the previous section, for n > and 1 < £ < Z n , the symbol T n ' e 
denotes the sub-tree of T whose root is (n, I). The node (n,£) is in the sampled 
tree if A/"(T"' £ ) ^ 0. Since nodes at a given depth are selected independently one 
of each other, we have 

(oo r?(n,l) \ 

-(£) xy-^H' 

where zf 1 is the number of descendants of {n, £) at generation i and where we 
have used the fact that the sub-tree T™^ has the same offspring distribution as the 
original tree T. Hence, 

P (Af{T n - e ) £ | (n, £) G T) = 1 - E ^exp 

It follows that the size of the sampled tree given by 

-{-oo Z n 

(so) %) = EEw f ^o } 

n=0 l=\ 

and its mean value is, by using the independence in the selection of nodes, 

+oo 

E(R(a)) = J2 nZn)^ (N(T n < 1 ) £ 0) , 
Equation ^ follows. □ 
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The growth rate of the exploration process is defined by the ratio 
E(fl(a)) 



^£ (i -«>4-44^t^))) 



ECAf(T)) ri(c 
where 77(a) = (1 - a)E(jV(T)) — > 1 when a — * 1. 

Theorem 2. 7/E(G 2 ) < +oo, as a / 1, t/ie following limit relation holds 

a" 1 E(AA(T)) 2 
Proof. Let us first introduce the function 

+ 00 / / / TTr 



fl-(a) = ^(1 - a)m n ( 1 - E I exp I - ( 



\mJ 1 — a 



where W is defined by Equation 0. We have 
E(R(a)) 



(31) 
(32) 



H(a) — rj(a) 



E(7V(T)) 



n=0 



oo 

8=0 

< Var(W) ^ 



i=0 



Zi 

— -W 



o 

fm 



V&r(W) 
1--S-' 



where we have used Inequality ()13[) in the last step. 

Let us define the family of non-negative random variables Tl a , < a < 1 by 



+oo 



7i„ = ^(1 - a) 2 ?n™ 1 - exp ( - 



n=0 



n ^ 



to/ I — a 



We have (1 - a)ff(a) = E(W a ). 

Let us fix some e > 0. Since > a.s., we can define the quantity 

W 



n(W, a) = max 



(e(l-a)) 



.0 



< £. 



For n > n(W, a), 

VtoJ r 

By using the fact that for x > 0, 1 — e~ x > x — x 2 /2, we have 

w Q > 2 (1 " a)2m "( 1 " exp ("(£)"r^)) >« n(w;a) w(i-e). 



n—n{W,a 



Since the above inequality is valid for all e > and q"' b ' , °' converges to 1 as a /* 1, 
it follows that liminf a ^i 7i a > W a.s. 
Since 1 — e~ x < x for x > 0, we have 



H Q < W a.s. 
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and then limsupQ,^ H a < W a.s. Hence, limsup Q _^ 1 H a = W a.s. Since the family 
(Ti-a) is non negative and bounded by W, which is integrable, we have 

lim E(«c) = E(W) = 1 

a— >1 

and the result follows by using Inequality (f5!?)) . □ 

When a < 1 the selected node are closed to the root and only a small fraction 
of the whole is discovered. When a /* 1, we can select nodes deeper in the tree 
but roughly only one node is selected in average at each level. The above result 
indicates that the average size of the discovered tree grows as the square of the 
average number of selected nodes. 
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